Defines the options for extracting text from PDF files.

Full documentation

Initializes a new PDFTextExtractionOptions object.

Full documentation

Initializes a new PDFTextExtractionOptions object.

Full documentation

Gets the bounds (in display coordinates) for text extraction.

Full documentation

Gets a value indicating whether extended text information should be loaded for text. This flag is used only by com.o2sol.pdf4java.content.PDFContentExtractor.extractText , com.o2sol.pdf4java.content.PDFContentExtractor.extractText, com.o2sol.pdf4java.content.PDFContentExtractor.extractTextRuns and com.o2sol.pdf4java.content.PDFContentExtractor.extractTextRuns methods.
By default this property is true which allows to analyze text fragment positions in order to group extracted text into lines. If it is set to false then only the text is loaded and no other properties (suc as positions, font info, colors, etc).

Full documentation

Gets a value indicating whether characters that fit partially the extraction bounds should be included in the extracted text.

Full documentation

This method supports the internal library infrastructure and should not be used by external client code.

Full documentation

Gets a flag indicating whether the text extraction process should use the text included in the /ActualText entry applied to current showText operator.

Full documentation

Sets the bounds (in display coordinates) for text extraction.

Full documentation

Sets a value indicating whether extended text information should be loaded for text. This flag is used only by com.o2sol.pdf4java.content.PDFContentExtractor.extractText , com.o2sol.pdf4java.content.PDFContentExtractor.extractText, com.o2sol.pdf4java.content.PDFContentExtractor.extractTextRuns and com.o2sol.pdf4java.content.PDFContentExtractor.extractTextRuns methods.
By default this property is true which allows to analyze text fragment positions in order to group extracted text into lines. If it is set to false then only the text is loaded and no other properties (suc as positions, font info, colors, etc).

Full documentation

Sets a value indicating whether characters that fit partially the extraction bounds should be included in the extracted text.

Full documentation

This method supports the internal library infrastructure and should not be used by external client code.

Full documentation

Sets a flag indicating whether the text extraction process should use the text included in the /ActualText entry applied to current showText operator.

Full documentation